A Multi-instance Multi-label Dual Learning Approach for Video Captioning

نویسندگان

چکیده

Video captioning is a challenging task in the field of multimedia processing, which aims to generate informative natural language descriptions/captions describe video contents. Previous approaches mainly focused on capturing visual information videos using an encoder-decoder structure captions. Recently, new encoder-decoder-reconstructor was proposed for captioning, captured both and Based this, this article proposes novel multi-instance multi-label dual learning approach (MIMLDL) captions based structure. Specifically, MIMLDL contains two modules: caption generation reconstruction modules. The module utilizes lexical fully convolutional neural network (Lexical FCN) with weakly supervised mechanism learn translatable mapping between regions labels Then synthesizes sequences reproduce raw outputs module. A fine-tunes modules according gap reproduced videos. Thus, our can minimize semantic generated by minimizing differences sequences. Experimental results benchmark dataset demonstrate that improve accuracy captioning.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multi-instance multi-label learning

In this paper, we propose the MIML (Multi-Instance Multi-Label learning) framework where an example is described by multiple instances and associated with multiple class labels. Compared to traditional learning frameworks, the MIML framework is more convenient and natural for representing complicated objects which have multiple semantic meanings. To learn from MIML examples, we propose the Miml...

متن کامل

Multi-instance multi-label active learning

Multi-instance multi-label learning (MIML) has achieved success in various applications, especially those involving complicated learning objects. Along with the enhancing of expressive power, the cost of annotating a MIML example also increases significantly. In this paper, we propose a novel active learning approach to reduce the labeling cost of MIML. The approach actively query the most valu...

متن کامل

Learnability of Multi - Instance Multi - Label Learning

Multi-Instance Multi-Label learning (MIML) is a new machine learning framework where one data object is described by multiple instances and associated with multiple class labels. During the past few years, many MIML algorithms have been developed and many applications have been described. However, there lacks theoretical exploration to the learnability of MIML. In this paper, through proving a ...

متن کامل

Fast Multi-Instance Multi-Label Learning

In multi-instance multi-label learning (MIML), one object is represented by multiple instances and simultaneously associated with multiple labels. Existing MIML approaches have been found useful in many applications; however, most of them can only handle moderatesized data. To efficiently handle large data sets, we propose the MIMLfast approach, which first constructs a low-dimensional subspace...

متن کامل

A Framework of Hashing for Multi-instance Multi-label Learning

Multi-instance multi-label learning (Miml) is a powerful framework, which deals with the problem that each example is represented as multiple instances and associated with multiple class labels. Previous works mostly focus on accuracy, while scalability for large scale datasets has been rarely addressed. In this paper, we present a novel framework – Multi-instance Multi-label Hashing (MimlH) to...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: ACM Transactions on Multimedia Computing, Communications, and Applications

سال: 2021

ISSN: ['1551-6857', '1551-6865']

DOI: https://doi.org/10.1145/3446792